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Fitting Proportional Odds Models to Educational Data in Ordinal Logistic Regression 

Using Stata, SAS and SPSS 

Introduction 

The proportional odds (PO) model, which is also called cumulative odds model 
(Agresti, 1996, 2002 ; Armstrong & Sloan, 1989; Long, 1997, Long & Freese, 2006; 
McCullagh, 1980; McCullagh & Nelder, 1989; Powers & Xie, 2000; O’Connell, 2006), is 
one of the most commonly used models for the analysis of ordinal categorical data and 
comes from the class of generalized linear models. It is a generalization of a binary 
logistic regression model when the response variable has more than two ordinal 
categories. The proportional odds model is used to estimate the odds of being at or below 
a particular level of the response variable. For example, if there are j levels of ordinal 
outcomes, the model makes J-l predictions, each estimating the cumulative probabilities 
at or below the / h level of the outcome variable. This model can estimate the odds of 
being at or beyond a particular level of the response variable as well, because below and 
beyond a particular category are just two complementary directions. 

In education research, the use of ordinal categorical data is quite abundant, and 
researchers need to understand and be familiar with the ordinal logistic regression models 
dealing with the internally ordinal outcome variables. In some situations, Ordinary Least 
Squares (OLS) techniques may be used for preliminary analysis of such data by treating 
the ordinal scale variable as continuous. However, ignoring the discrete ordinal nature of 
the variable would make the analysis lose some useful information and lead to misleading 
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results. Therefore, it is crucial for researchers to use the most appropriate models to 
analyze ordinal categorical dependent variables. 

Researchers currently have a variety of options when choosing statistical software 
packages that can perform ordinal logistic regression models. For example, some general 
purpose statistical packages, such as Stata, SAS, and SPSS, all provide the options of 
analyzing proportional odds models. However, these statistical packages may use 
different techniques to estimate the ordinal logistic models. Long and Freese (2006) 
pointed out that Stata estimates cut-points in the ordinal logistic model while setting the 
intercept to be 0; other statistical software packages might estimate intercepts rather than 
cut-points. Agresti (2002) introduced both the proportional odds model the latent variable 
model, and stated that parameterization in SAS (Proc Logistic) followed the formulation 
of the proportional odds model rather than the latent variable model. Hosmer and 
Lemeshow (2000) used a formulation which was consistent with Stata’ s expression to 
define the ordinal regression model by negating the logit coefficients. Because statistical 
packages may estimate parameters in the ordinal regression model differently following 
different equations, the outputs they produce may not be the same, and thus they seem 
confusing to applied statistician and researchers. Researchers are more likely to make 
mistakes in interpreting the results if ignoring the differences in parameter estimations 
using different software packages. Therefore, it is the aim of the study to clarify the 
misunderstanding and confusion when fitting ordinal regression models. Further, no 
study has been conducted to demonstrate fitting the proportional odds model using all 
these three general-purpose statistical packages, compare differences and identify 
similarities among them. Thus, this study seeks to fill this gap. The purpose of this paper 
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is to: (1) demonstrate the use of Stata, SAS and SPSS to fit the proportional odds model 
to educational data; and (2) compare the features and results for fitting the proportional 
odds model using Stata OLOGIT, SAS PROC LOGISTIC (ascending and descending), 
and SPSS PLUM. The data of a survey instrument TPGP (Teachers’ Perceptions of 
Grading Practices ) is used to demonstrate the PO analysis. 

Theoretical Lramework 

In an ordinal logistic regression model, the outcome variable is ordered, and has 
more than two levels. Lor example, student’s SES ordered from low to high; children’s 
proficiency in early reading scored from level 0 to 5; and a response scale of a survey 
instrument ordered from strongly disagree to strongly agree. One appealing way of 
creating the ordinal variable is via categorization of an underlying continuous variable 
(Hosmer & Lemeshow, 2000). In this paper, the ordinal outcome variable is teachers’ 
teaching experience level, and it is coded as 1, 2, and 3 (1 = low; 2 = medium; and 3 = 
high). This variable is categorized based on a continuous variable, teaching years. 
Teachers with teaching experience less than five years are categorized in the low teaching 
experience level; teachers with teaching experience between 6 and 15 years are 
categorized in the medium level; and teachers with teaching experience above 15 years 
are categorized in the high level. The distribution of teaching years is very positively 
skewed. The violation of the assumption of normality makes the use of Multiple 
Regression inappropriate. Therefore, the ordinal logistic regression is the most 
appropriate model for analyzing the ordinal outcome variable here. 



A Latent-variable Model 
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The ordinal logistic regression model can be expressed as a latent variable model 
(Agresti, 2002; Greene, 2003; Long, 1997, Long & Lreese, 2006; Powers & Xie, 2000; 
Wooldridge & Jeffrey, 2001). Assuming a latent variable, Y* exists, we can define Y* = 
XP + 8, where x is a row vector (1* k) containing no constant, p is a column vector (k*l) 
of structural coefficients, and s is random error with standard normal distribution: s ~ N 
(0, 1). 

Let Y* be divided by some cut points (thresholds): ai,a 2 , 013 ... aj, and 
a 1 <a. 2 <a, 3 . . . < aj. Considering the observed teaching experience level is the ordinal 
outcome, y, ranging from 1 to 3, where 1= low, 2 = medium, and 3 = high, we define: 

1 ify*^oc 1 

Y = < 2 if a x < y* < a 2 > 

3 if a 2 < y* < 00 

Therefore, we can compute probability of a teacher at each experience level. Lor example, 
P(y=l) = P (y* <a0 = P(xp + s < aO = F (ai- xP); 

P(y=2) = P (ai<y* <af) = F (a 2 - xP)- F (ai- xP); 

P(y=3) = P (a 2 <y* < 00 ) = 1- F (a 2 - xP); 

We can also compute the cumulative probabilities using the form: P(Y<j) = F (aj - xP), 
where j = 1, 2,...J-1. (1) 

General Logistic Regression Model 

In a binary logistic regression model, the response variable has two levels, with 
l=success of the events, and 0=failure of the events. We predict the probability of 
success on a set of predictors. The logistic regression model can be expressed as: 

ln(Y') = logit [ti(x)] = In , = a + piXi + p 2 X 2 + . . . p p X p . 

^l-7l(x)J 



( 2 ) 
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In Stata, the ordinal logistic regression model is expressed in logit form as follows: 

\ 

■ =a j + (-piX 1 -p 2 X 2 - ... -ppX p) , (3) 

) 

where 7tj(x) = 7t(Y<jlxi,x 2 ,...x p ), which is the probability of being at or below category j, 
given a set of predictors, j =1, 2, . . . J -1. aj are the cut points, and Pi, p 2 ...P P are logit 
coefficients. This is the form of a Proportional Odds (PO) model because the odds ratio 
of any predictor is assumed to be constant across all categories. Similar to logistic 
regression, in the proportional odds model we work with the logit, or the natural log of 
the odds. To estimate the In (odds) of being at or below the jth category, the PO model 
can be rewritten as: 



ln(Yj') = logit [7t(x)] = In 



l-*j( x 



logit [7r(Y<j I xl,x2,...x p )] = In 



"ti(y < jlx 1 ,x 2 ,...^) > 

v 7i(Y>jlx 1 ,x 2 ,...xJ y 



a j + ( _ PiXi -p 2 X 2 - ... -PpXp) 



(4) 

Thus, this model predicts cumulative logits across J -1 response categories. By 
transforming the cumulative logits, we can obtain the estimated cumulative odds as well 
as the cumulative probabilities being at or below the / h category. 



However, SAS uses a different ordinal logit model for estimating the parameters from 
Stata. For SAS PROC LOGISTIC (the ascending option), the ordinal logit model has the 
following form: 



logit [7r(Y<j I xi, x 2 ,...x p )] = In 



"ti(y < jlx 1 ,x 2 ,...xj^ 
v 7i(Y>jlx 1 ,x 2 ,...xJ y 



a j + PiXi +p 2 X 2 + ... +P P X p ; 



( 5 ) 
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Using SAS with the descending option, the ordinal logit model can be expressed as: 



logit [7r(Y>j I xi, x 2 ,...x p )] = In 



7l( Y > jlXj.Xj,...^) 

tt(y < jl x,, J 



- C(,j + P 1 X 1 +p 2 X 2 + ... +PpX p 



(6) 

where in both equations otj are the intercepts, and Pi, p 2 ...P P are logit coefficients. 

SPSS PLUM (Polytomous Universal Model) is an extension of the generalized 
linear model for ordinal response data. It can provide five types of link functions 
including logit, probit, complementary log-log, cauchit and negative log-log. Just as Stata, 
the ordinal logit model is also based on the latent continuous outcome variable for SPSS 
PLUM, it takes the same form as follows: 



logit [7T(Y<j I xi, x 2 ,...x p )] = In 



ti(y < jlXj,X 2 ,...xJ 
7t(y > j I x , ,x 2 ,...x p ) 



a j + (-piX 1 -p 2 X 2 - ... -ppX p) , 



(7) 

where aj’s are the thresholds, and Pi, p 2 ...P P are logit coefficients; j = 1, 2. . J-l. 

Compared to both Stata and SPSS, SAS (ascending and descending) does not 
negate the signs before the logit coefficients in the equations, because SAS Logistic 
procedure (Proc Logistic) is used to model both the dichotomous and ordinal categorical 
dependent variables, and the signs before the coefficients in the ordinal logit model are 
kept consistent with those in the binary logistic regression model. Although the signs in 
the equations are positive, SAS internally changes the signs of the estimated intercepts 
and coefficients according to different ordering of the dependent variable (with the 



ascending or descending option). 
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Methodology 

Sample 

The data were collected from teachers at three middle schools and a teacher’s 
training school in Taizhou City, Jiangsu Province, China, using a survey instrument of 
Teachers’ Perceptions of Grading Practices (Liu, 2004; Liu, O’Connell & McCoach, 
2006). A total of 147 teachers responded to the survey with the response rate of 73.5%. 
The outcome variable of interest is teachers’ teaching experiences, which is an ordinal 
categorical variable with l=low, 2=medium, and 3=high. 

Explanatory variables included gender (female = 1; male = 2) and a set of scale 
scores from the survey instrument Teachers’ Perceptions of Grading Practices. The 
instrument includes five scales measuring the importance of grading, the usefulness of 
grading, student effort influencing grading, student ability influencing grading, and 
teachers’ grading habits. Composite scale scores were created by taking a mean of all the 
items for each scale. Table 1 displays the descriptive statistics for these independent 
variables. 

Table 1: Descriptive Statistics for all the Variables, n=147 



Teaching Experience Level 



Teaching 


1 


2 


3 


Total n=147 




n = 70 


n = 45 


32 


100% 




47.6% 


30.6% 


21.8% 




% gender 


74.3% 


66.7% 


50% 


66.7% 


(Female) 










import ance 


3.33 (.60) 


3.31 (.63) 


3.55 (.79) 


3.37 (.66) 
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usefulness 


3.71 (.61) 


3.38 (.82) 


3.70 (.66) 


3.60 (.70) 


effort 


3.77 (.50) 


3.79 (.46) 


3.80 (.68) 


3.78 (.53) 


ability 


3.74 (.40) 


3.75 (.54) 


3.87 (.51) 


3.77 (.47) 


habits 


3.38 (.66) 


3.57 (.66) 


3.49 (.60) 


3.46 (.65) 



Methods 

We began by fitting the proportional odds model with a single explanatory 
variable using Stata (V. 9.2) OLOGIT. Then we fitted the full-model with all six 
explanatory variables. The assumption of proportional odds for both models was 
examined using the Brant test. Additional Stata subcommands demonstrated here 
included FITSTAT and LISTCEOF of Stata SPost (Long & Freese, 2006) used for the 
analysis of post-estimations for the models. The results of fit statistics, cut points, logit 
coefficients and cumulative odds of the independent variables for both models were 
interpreted and discussed. The same model was fit using SAS (V. 9.1.3) (ascending and 
descending), and SPSS (V. 13.0), and the similarities and differences of the results using 
all three programs were compared. 



Results 

Proportional Odds Model with a Single Explanatory Variable 

A proportional odds model with a single predictor, gender was fitted first. Stata 
OLOGIT procedure was used, and the Stata syntax for ordinal logistic regression is 



shown below. 
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OLOGIT is the Stata program estimating ordinal logistic regression models of 
ordinal outcome variable on the independent variables. In this example, the outcome 
variable, teaching was followed immediately by the independent variable, gender. Figure 
1 displays the Stata output for the one -predictor proportional odds model. 

Figure 1: Stata Proportional Odds Model Example: Gender 



. ologit teaching gender 

Iteration 0: log likelihood = -153.99556 

Iteration 1: log likelihood = -151.35669 

Iteration 2: log likelihood = -151.35194 

Number of obs 
LR chi2 ( 1 ) 
Prob > chi2 
Pseudo R2 



Ordered logistic regression 
Log likelihood = -151.35194 



147 
5.29 
0.0215 
0 .0172 



teaching 


Coef . 


Std. Err. 


z 


P> 1 z | 


[95% Conf. 


Interval ] 


gender 


.7587563 


.3310069 


2.29 


0 . 022 


. 1099947 


1 .407518 


/cutl 


. 9043487 


.4678928 






-.0127044 


1 . 821402 


/cut2 


2 . 320024 


.5037074 






1 . 332775 


3 .307272 



The log likelihood ratio Chi-Square test with 1 degree of freedom, LR x 2 < n = 5.29, 
p= .0215, indicating that the logit regression coefficient of the predictor, gender is 
statistically different from 0, so the full model with one predictor provides a better fit 
than the null model with no independent variables in predicting cumulative probability 
for teaching experience level. The likelihood ratio R 2 L = .0172, which is the Pseudo R 2 , 
and is also called McFadden’s R 2 , suggesting that the relationship between the response 
variable, teaching experience, and the predictor, gender is small. More measures of fit 
were obtained when using SPost subcommand fitstat (Long & Freese, 2006). In addition 
to the deviance statistic and McFadden’s R , several other types of R statistics are 
reported (Figure2). The information measures, AIC and BIC, are used to compare either 
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nested or non-nested models. Smaller AIC and BIC statistics indicate the better fitting 
model. 

Figure 2: Measures of Fit Statistics 



. fitstat 



Measures of Fit for ologit of teaching 



Log-Lik Intercept Only: 


-153.996 


Log-Lik Full Model: 


-151.352 


D (144) : 


302 .704 


LR ( 1 ) : 


5.287 






Prob > LR: 


0.021 


McFadden's R2 : 


0.017 


McFadden's Adj R2 : 


-0.002 


ML (Cox-Snell) R2 : 


0.035 


Cragg-Uhler (Nagelkerke ) R2 : 


0.040 


McKelvey & Zavoina's R2 : 


0.038 






Variance of y* : 


3.419 


Variance of error: 


3.290 


Count R2 : 


0.476 


Adj Count R2 : 


0.000 


AIC: 


2.100 


AIC*n : 


308.704 


BIC : 


-415.918 


BIC' : 


-0.297 


BIC used by Stata: 


317 . 675 


AIC used by Stata: 


308.704 



The estimated logit regression coefficient, P = .7588. z = 2.29, p=.022, indicating 
that gender has a significant effect on teacher’s teaching experience level. Substituting 
the value of the coefficient into the formula (4), logit [7x(Y<j I gender)] = aj + (-P 1 X 1 ), 
we calculated logit [jr(Y<j I gender)] = aj - .7588 (gender). OR = e ( '' 7588) = .46 8, 
indicating that male teachers are .468 times the odds for female teachers of being at or 
below at any category, i.e., female teachers are more likely than male teachers to be at or 
below a particular category, because males are coded as 2 and girls are codes as 1. 

The results table reports two cut-points: _cutl and_cut2. These are the estimated 
cut-points on the latent variable, Y*, used to differentiate the adjacent levels of categories 
of teaching experiences. When the response category is 1, the latent variable falls at or 
below the first cut point, <*i. When the response category is 2, the latent variable falls 
between the first cut point ai and the second cut point 012 , and when the response category 
reaches 3 if the latent variable is at or beyond the second cut point 012 . 
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To estimate the cumulative odds being at or below a certain category, j for gender, 
we went back to the logit form of proportional odds model, logit [ jx( Y<j I gender)] = a,j - 
.7588 (gender). For example, when Y<1, ai. .9043 is the first cut point for the model. 
Substituting it into the formula (4), we get logit [ tx( Y<j I gender)] = .9043 - .7588 
(gender). For girls (x=l), logit [ti(Y< 1 I gender)] = .1455. By exponentiating the logit, we 
calculate the odds for female teachers of being at or below experience category 1, e 1455 = 
1.157; For male teachers (x=2), logit [ti(Y< 1 I gender)] = .9043 - .7588*2 = -.6133, so the 
odds for male teachers being at or below teaching experience category 1, e'' 6133 =.542. 
Odds ratio of male teachers versus female teachers =.542/1.157= .468. Transforming the 
cumulative odds, we can get the cumulative probabilities by using p=odds/(l+odds). 

Stata program brant is used to test the proportional odds assumption. Brant (1990) 
proposed a test of proportional odds assumption for the ordinal logistic model by 
examining the separate fits to the underlying binary logistic models. A non-significant 
omnibus test indicates that the proportional odds assumption is not violated. It also 
provides tests for each individual independent variable. When there is only one 
independent variable in the model, the results of the omnibus test and individual test are 
the same. The brant test of parallel regression assumption yields y 2 \ = .40 (p > .527), 
indicating that the proportional odds assumptions for the full-model is upheld. This 
suggests that the effect of the explanatory variable, gender, is constant across separate 
binary models fit to the cumulative cut points. The table also provides the estimated 
coefficient from j-1 binary logistic regression models. Each logistic regression model 
estimates the probability of being at or beyond teaching experience level j. 

Figure 3: Brant Test of Parallel Regression (Proportional Odds) Assumption 
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. brant, detail 

Estimated coefficients from j-1 binary regressions 

y>l y>2 

gender .66621777 .91021169 

_cons -.78882009 -2.5443422 

Brant Test of Parallel Regression Assumption 



Variable | 


chi2 


p>chi2 


df 


All | 


0 . 40 


0.527 


1 


gender | 


0 . 40 


0.527 


1 



A significant test statistic provides evidence that the parallel 
regression assumption has been violated. 



The Proportional Odds Model can also estimate the ln(odds) of being at or beyond 
category j, given a set of predictors. Again, these ln(odds) can be transformed into the 
cumulative odds, and cumulative probabilities as well. For example, we can estimate the 
cumulative probability of a teacher’s teaching experience at or beyond category 3, 
P(Y>3), at or beyond category 2, P(Y>2), and P(Y>1), which equals 1. The cumulative 
logit form can be expressed as: logit [7t(Y>j I gender)] = -Oj + (PiXi). In Stata, when 
estimating the odds of being at or beyond category j, the sign of the cut points needs to be 
reversed and their magnitude remain unchanged, since we estimate the cut points from 
the right to the left of the latent variable, Y*, that is, from the direction when Y=3 
approaches Y=l. Therefore, two cut points from right to left turn to -2.32 and - .904. 
When the predictor is dichotomous, a positive sign of the logit coefficient indicates that it 
is more likely for the group (x=l) to be at or beyond a particular category than for the 
relative group (x = 0). When the predictor is continuous, a positive coefficient indicates 
that when the value of the predictor variable increases, the probability of being at or 
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beyond a particular category increases. Using Stata syntax listcoef, we can obtain the 
odds of being at or beyond a particular category at 2.136, which is constant across all 
cumulative categories. It also indicates that male teachers are 2.136 times the odds for 
female teachers of being at or beyond any category, i.e., male teachers are more likely 
than female teachers to be at or beyond a particular category. Figure 4 displays the results 
of Stata listcoef. Adding option percent after listcoef, we can get the result of percentage 
change in odds of being at or beyond a particular category, when the predictor, gender, 
goes from males (x = 2) to females (x = 1). 

Figure 4: Results of Stata listcoef. 



. listcoef, help 

ologit (N=147) : Factor Change in Odds 
Odds of: >m vs <=m 



teaching | 


b 


z 


P> 1 z | 


e A b 


e A bStdX 


SDofX 


gender | 


0.75876 


2.292 


0 . 022 


2 . 1356 


1 . 4318 


0 .4730 



b = raw coefficient 
z = z-score for test of b=0 
P > | z | = p-value for z-test 

e^b = exp(b) = factor change in odds for unit increase in X 
e A bStdX = exp (b*SD of X) = change in odds for SD increase in X 
SDofX = standard deviation of X 



Proportional Odds Model with Six Explanatory Variables 

Next, we fit a proportional odds model with eight explanatory variables, which is 
referred to as the Full Model. Figure 5 displays the results for the fitting of the full model 
with six explanatory variables. 

Figure 5: Full-Model Analysis of Proportional Odds Using Stata 

. ologit teaching gender importance usefulness effort ability habits 
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Iteration 0 
Iteration 1 
Iteration 2 
Iteration 3 



log likelihood 
log likelihood 
log likelihood 
log likelihood 



= -153.99556 
= -147.15002 
= -147.12664 
= -147.12663 



Ordered logistic regression 
Log likelihood = -147.12663 






Number of obs = 
LR chi2 ( 6 ) = 
Prob > chi2 = 
Pseudo R2 = 


147 
13.74 
0 . 0327 
0 . 0446 


teaching 


Coef . 


Std. Err. 


z 


P> 1 z | 


[95% Conf. 


Interval ] 


gender 


. 8069529 


. 3481713 


2 . 32 


0 . 020 


. 1245496 


1.489356 


importance 


.5754665 


.3033406 


1.90 


0 . 058 


-.0190701 


1 . 170003 


usefulness 


-.634543 


.2732754 


-2 . 32 


0 . 020 


-1 . 170153 


-.098933 


effort 


. 0262542 


.3873408 


0.07 


0.946 


-.7329199 


.7854283 


ability 


.3429958 


.4158473 


0.82 


0 . 409 


-.4720499 


1 . 158042 


habits 


.3178664 


.2921743 


1.09 


0 .277 


-.2547847 


.8905175 


/cutl 


3 . 122828 


1 . 636374 






-.0844068 


6.330062 


/cut 2 


4.598343 


1 . 660188 






1 . 344434 


7 . 852251 



Before interpreting the results of the full model, we first examined the assumption 
of proportional odds. The Stata brant command provides the results of the brant test of 
parallel regression (Proportional Odds) assumption for the full model with six predictors 
and tests for each independent variable. It also provides the estimated coefficient from j-1 
binary logistic regression models results of two separate binary logistic regression models. 
The data are dichotomized according to the cumulative probability pattern so that each 
logistic regression model estimates the probability of being at or beyond teaching 
experience level j. For the omnibus brant test, y_ 2 () = 8.10, p = .230, indicating that the 
proportional odds assumptions for the full-model is upheld. Examining the brant tests for 
each individual independent variable, we find that the brant test of the assumption of 
parallel regression (proportional odds) are upheld for gender, importance, effort, ability 
and habits. For usefulness, the brant test, y 2 1 = 4.03, p = .045, which is very close to .05, 
therefore, we can also conclude that the PO assumption for this variable is almost upheld 
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Checking the estimated coefficients for each independent variable across two binary 



logistic regression models, we find the logit coefficients for all the variables were similar 



across two binary logistic models, which supports the results of the brant test of the 



proportional odds assumption. 



Figure 6: Brant Test of Parallel Regression (Proportional Odds) Assumption 



. brant, detail 






Estimated coefficients from j-1 binary regressions 




y>l 


y>2 




gender 


.74115294 


.86086025 




importance 


. 64416122 


.46874536 




usefulness 


.94566294 - 


.19259753 




effort 


.09533898 - 


. 03621639 




ability 


.26862373 


. 68349765 




habits 


.48959286 - 


. 02795948 




_cons 


2.7097459 - 


5 .7522624 




Brant Test of 


Parallel Regression Assumption 


Variable 


| chi2 


p>chi2 


df 


All 


I 8.10 


0.231 


6 


gender 


I 0.08 


0 .772 


1 


importance 


I 0.24 


0 . 622 


1 


usefulness 


! 4.03 


0 . 045 


1 


effort 


I 0.10 


0.746 


1 


ability 


I 0.66 


0 . 418 


1 


habits 


I 2.15 


0 . 142 


1 


A significant 


test statistic provides 


evidence that the parallel 


regression assumption has 


been violated. 



The log likelihood ratio Chi-Square test, LR x 2 (6) = 13.738, p= .033, indicating 
that the full model with six predictor provides a better fit than the null model with no 
independent variables in predicting cumulative probability for teaching experience. The 
likelihood ratio R l = .045, much larger than that of the gender-only model, but still small, 
suggesting that the relationship between the response variable, teaching experience, and 
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2 

six predictors, is still small. Compared with the gender-only model, all R statistics of the 
full-model shows improvement (Figure 7). 

Figure 7: Measure of Fit Statistics for Full-Model 



. fitstat 



Measures of Fit for ologit of teaching 



Log-Lik Intercept Only: 


-153.996 


Log-Lik Full Model: 


-147 . 127 


D (139) : 


294.253 


LR ( 6 ) : 


13.738 






Prob > LR: 


0.033 


McFadden's R2 : 


0.045 


McFadden's Adj R2 : 


-0.007 


ML (Cox-Snell) R2 : 


0.089 


Cragg-Uhler (Nagelkerke ) R2 : 


0 . 102 


McKelvey & Zavoina's R2 : 


0.098 






Variance of y* : 


3.646 


Variance of error: 


3.290 


Count R2 : 


0.429 


Adj Count R2 : 


-0.091 


AIC : 


2 .111 


AIC*n : 


310.253 


BIC : 


-399.417 


BIC' : 


16.205 


BIC used by Stata: 


334 . 177 


AIC used by Stata: 


310.253 



The Stata listcoef command (Figure 8) produces more detailed results of logit 



coefficients and cumulative odds (exponentiated coefficients). For the proportional odds 



model, interpretation of cumulative odds is independent on the ancillary parameters (cut 



points), because they are constant across all levels of the response variable. 



Figure 8: Results of logit coefficient, cumulative odds, and percentage change in odds 



. listcoef, help 

ologit (N=147) : Factor Change in Odds 
Odds of: >m vs <=m 



teaching 


1 


b 


z 


P> 1 z | 


e A b 


e A bStdX 


SDofX 


gender 


1 


0 . 80695 


2.318 


0 . 020 


2.2411 


1 .4648 


0 .4730 


importance 


1 


0 . 57547 


1.897 


0 . 058 


1 .7780 


1 .4601 


0 . 6578 


usefulness 


1 


-0 . 63454 


-2.322 


0.020 


0.5302 


0 . 6402 


0 . 7029 


effort 


1 


0 . 02625 


0 .068 


0 . 946 


1.0266 


1 .0140 


0.5283 


ability 


1 


0 . 34300 


0 . 825 


0 . 409 


1.4092 


1 . 1752 


0 .4707 


habits 


1 


0 . 31787 


1 . 088 


0.277 


1 . 3742 


1.2282 


0.6466 



b = raw coefficient 
z = z-score for test of b=0 
P > | z | = p-value for z-test 

e A b = exp(b) = factor change in odds for unit increase in X 
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e^bStdX = exp (b*SD of X) = change in odds for SD increase in X 
SDofX = standard deviation of X 



The effects of the independent variables can be interpreted in several ways: how 
they contribute to the odds, and then probabilities of being at or beyond a particular 
category. They can also be interpreted as how these variables contribute to the odds of 
being at or below a particular category, if we reverse the sign before the estimated logit 
coefficients and then compute corresponding cumulative odds. In terms of odds ratios, 
male teachers are 2.241 times the odds for female teachers to be at or beyond a particular 
category (OR=2.241), after controlling the effects of other predictors in the model. The 
usefulness of grading, whose corresponding OR is significantly less than 1.0, has 
significant negative effects in the model. These cumulative odds are associated with a 
teacher being in lower teaching experience categories rather than in higher categories. 

For one unit increase in the usefulness of grading, the odds ratio of being in higher 
teaching experience categories versus lower categories is .53 times lower, after 
controlling for the effects of other variables. However, variables whose corresponding 
ORs are significantly greater than 1 .0 have significant positive effects on the response 
variable in the model. For example, the importance of grading (OR=1.778) has a positive 
effect on teachers being in higher teaching experience categories. For one unit increase in 
the importance of grading, the odds ratio of being in higher teaching experience 
categories versus lower categories is 1.778 times greater, given the effects of other 
predictors are held constant. Variables such as student ability and teacher’s grading habits, 
whose corresponding ORs are greater than 1.0, but are not statistically significant, have 
positive effects on the response variable, but these effects may be due to chance and need 
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further investigation. Independent variables with ORs close to 1 .0 have no effect on the 
response variable. For example, student effort influencing grading is not associated with 
teaching experience in this model (OR=1.0266, p=.946). 

A Comparison of results of a single-variable PO model using Stata, SAS, and SPSS 

Table 2 provides a comparison of the results for Stata OLOGIT with results from 
SAS PROC LOGISTIC with the ascending and descending options, and SPSS PLUM. 
The similarities and differences between these results should be paid great attention. 
Otherwise, it would be misleading to interpret the results in the same way, disregarding 
their different parameterizations. 

In estimating proportional odds models, Stata sets the intercept to 0, and estimates 
the cut points, while SAS ascending estimates the intercepts and set the cut points to 0. 
Comparing Stata with SAS (ascending), the different choice of parameterization does not 
influence the magnitude of cut points (or intercepts) and coefficients. However, it does 
determine the sign before these estimates. 

When estimating the odds of being at or below a response category, the estimates 
for the cut points using Stata are the same as the intercepts using SAS ascending in both 
sign and magnitude. The first cut point, ai in Stata estimation is the same as the first 
intercept ao in SAS ascending estimation, because there is no first intercept ao in Stata 
estimation. Using Stata and SAS (the ascending option), the estimated logit coefficients 
are the same in magnitude but are opposite in sign. Using Stata, the estimated logit 
coefficient (]= .759. Substituting it into the logit form (4), we get logit [ 7i( Y<j I gender)] = 
a,j -(.759)*(gender) = aj -.759*(gender). OR = e ( '' 759) = .468, indicating that male 
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teachers are .468 times the odds for female teachers of being at or below at any category, 
i.e., female teachers are more likely than male teachers to be at or below a particular 
teaching experience level. Using SAS ascending, the estimated logit coefficient, P= -.759. 
Substituting it into its corresponding logit form (5), we get the same equation: logit 
[ 7x( Y <j I gender)] = oij -.759*(gender). Therefore, we can get the same results of estimated 
cumulative odds and cumulative probability using Stata and SAS ascending. 
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Table 2: Results of Proportional Odds Model with a Single Variable Using Stata, SAS 
(Ascending and descending) and SPSS: A comparison 





STATA 


SAS 

(Ascending) 


SAS 

(Descending) 


SPSS 


Model estimates 


P(Y<j) 


P(Y<j) 


P(Y>j) 


P(Y<j) 


Outpoints 


_cutl(a0 


ai = .904 


a 2 = -2.32 


ai = -.613 


(Stata)/ 


= .904 








Intercept (SAS)/ 










Threshold 










(SPSS) 












_cut2(a 2 ) = 


a 2 = 2.32 


a 2 = -.904 


a 2 = .803 




2.32 








Gender 


.759 


-.759 


.759 


0 


(male=2) 










(female = 1) 








-.759 


LR R 2 


.017 


.017 


.017 


.017 


Brant test 


X 2 i = .40 (p 








(Omnibus test) 3 


> .527) 








Score test b 




X 2 i= .4026 (p 


X 2 i = .4026 (p 


X 2 i=- 392 (p 






= .5258) 


= .5258) 


> .530) 


Model fit 


LR X 2 (i, = 5.29, 


LRx 2 u) = 5.29, 


LRx 2 (d = 5.29, 


LRX 2 (D = 




p= .0215 


p= .0215 


p= .0215 


5.287, p= .021 



a. Brant test for proportional odds assumption. 

b. Score test for proportional odds assumption. 
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Comparing the results of proportional odds model using Stata and SAS with the 
descending option, we find the estimated cut points for Stata and the estimated intercepts 
for SAS descending are the same in magnitude but are opposite in sign. Using Stat and 
SAS descending, the estimated logit coefficients are the same in both magnitude and sign. 
To estimate the odds of being at or beyond a particular teaching experience level using 
Stata, we just need to reverse the sign of the estimated cut points. The estimated logit 
coefficient, (]=.759. Exponentiating it, we get e ( 759) =2.136, indicating male teachers are 
2.136 times greater than female teachers to be at or beyond a particular category. In other 
words, female teachers are less likely than male teachers to be at or beyond a certain 
category. 

Using Stata and SPSS, when estimating the effects of predictors on the log odds 
of being at or below a certain category of the outcome variable, the sign before the 
coefficients are both the “minus” rather than the “plus”. In other words, the effects of 
predictors are subtracted from the cut points or thresholds. SPSS PLUM labels the 
estimated logits for the predicator variables LOCATION. When the predicator variable is 
a continuous variable, the estimated logit coefficients are the same as those estimated by 
Stata OLOGIT in both magnitude and sign. However, SPSS PLUM is different from 
Stata OLOGIT in this aspect: when the predictor variable is categorical, for example 
gender, with l=female and 2=male, the estimated coefficient is only displayed for the 
category with smaller value, i.e., when gender =1. The category with larger value, 
gender=2, is the reference category, and has an estimate of 0. If we code gender with 
l=female and 0=male, the estimated coefficient is displayed for the case when gender=0. 
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and the estimated coefficient for female (gender =1) is 0. Using SPSS PLUM, the 
estimated logit coefficient, P= -.759 for the case when female =1, and [:S= 0 for the case 
when male = 2. Substituting it into the logit form (7), we get logit [7t(Y<j I gender)] = 
a,j -(-.759)*(gender) = a,j +.759*(gender). By exponentiating it, we get OR = e ( ' 759) 
=2.136, indicating that female teachers are 2.136 times the odds for male teachers of 
being at or below at a particular teaching experience level. This result is equivalent to that 
of Stata. 

The parameter estimation for the categorical predictor in SPSS PLUM makes the 
threshold values in the ordinal logit model different from those estimated by Stata 
OLOGIT. These differences can be seen in the results of the proportional odds model 
using Stata, SAS (ascending and descending), and SPSS (Table 2). In SPSS PLUM, the 
threshold estimates are for the case when gender = 2 (male teachers), while in Stata and 
SAS, the cut points or intercepts are for case when gender = 1 (female students). In the 
following we show how the equivalent results of estimated logit can be obtained using 
different estimates of cutpoints (thresholds) and logit coefficients fitted by Stata and 
SPSS. Lor example, using SPSS, the predicted logit for male teachers (gender =2) of 
being at or below teaching experience level 1, logit [ti( Y<1 1 gender)] = ai -0*(gender) 

= -.613 +0*(2) = -.613; the predicted logit for female teachers (gender =1) of being at or 
below teaching experience level 1, logit [ 7x( Y< 1 I gender)] = ai -(-.759)*(gender) = - 
.613 + .759*1 = .146. Using Stata, the predicted logit for male teachers (gender =2) of 
being at or below teaching experience level 1, logit [7 t(Y< 1I gender)] 

= ai -(.7 5 9)* (gender) = .904 -.759*2 = -.614; the predicted logit for female teachers 
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(gender =1) of being at or below teaching experience level 1, logit [ti(Y< 1 I gender)] = 
ai -(.759)*(gender) = .904 - .759*1 = .145. 

To test the proportional odds assumption, Stata uses Brant test of parallel 
regression assumption with the result % 2 i = .40 p > .527; SAS ascending and descending 
use score test and have the same results % 2 1 = .4026, p = .5258; SPSS uses test of parallel 
lines with the result = .392, p > .530. All tests produce similar results that the 
proportional odds model assumption is upheld. Across four models, the omnibus 
likelihood ratio tests produce the same results, indicating the proportional odds model 
with one variable gender fit better than the null model. 

Features of the ordinal logistic regression analysis using Stata, SAS and SPSS 
were listed and compared in Table 3. 
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Table 3: Feature Comparisons of the Ordinal Logistic Regression Analysis Using Stata, SAS and SPSS 





STATA 


SAS 


SPSS 


Model Specification 








Outpoints/ thresholds 


V 




V 


Intercept 




V 




Test hypotheses of logit coefficients 


V 


V 


V 


Maximum Likelihood Estimates 








Odds Ratio 


V 


V 




z-statistic or wald test for Parameter Estimate 


V 




V 


Chi-square Statistic for Parameter Estimate 




V 




Confidence Interval for Parameter Estimate 


V 




V 


Fit Statistics 








Loglikelihood 


V 


V 


V 


Goodness-of-fit Test 






V 


Pseudo R-Square 


V 




V 


Test ofPO Assumption 








Omnibus Test of Assumption of Proportional Odds 


V 


V 


V 


Test of Assumption of Proportional Odds for individual variable 


V 






Association of Predicted Probabilities and Observed Responses 




V 
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Conclusions 

In this paper, the use of proportional odds models was illustrated to predict 
teachers’ teaching experience level from a set of measures of teachers’ perceptions of 
grading practices. A single independent variable model and a full-model with six 
independent variables were fitted and compared. The assumptions of proportional odds 
for both models were examined. It was found that the assumption of proportional odds 
for both the single-variable model and the full-model was upheld. 

Results from the proportional odds model revealed that the usefulness of grading 
had a negative effect on the prediction of teaching experience level (OR=.53), while the 
importance of grading practices had a positive effect on the experience level (OR=1.78), 
after controlling for the effects of other variables. Although student effort influencing 
teachers’ grading practices (OR = 1.41) and teachers’ grading habits (OR = 1.37) had 
positive effects on teaching experience level, these effects are not found to be significant. 
Compared to male teachers, female teachers were more likely to be at or below a 
particular category, or in other words, males are more likely to be at or beyond an 
experience level. Student effort influencing grading was not associated with teachers’ 
teaching experience level in the model. These findings suggest that teachers with longer 
teaching experiences tended to feel the grading practices are more important than the 
teachers with fewer years of teaching. However, teachers with longer teaching 
experiences tended to doubt the usefulness of grading in their teaching. This may be due 
to the reason they were forced to conduct test-oriented teaching in China. In addition, the 
gender difference suggests that female teachers were easier to be categorized as 



inexperienced teachers. This was because more and more female students received the 
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opportunities of higher normal education, and chose being teachers as their profession in 
the recent years. The frequencies of new female teachers are more than those of new male 
teachers in current China. 

Comparing the results using Stata and SAS, we found that both packages 
produced the same or similar results in model fit statistics, and the test of proportional 
odds assumption. The estimated coefficients and cut points (thresholds) were the same in 
magnitude but may be reversed in sign. Comparing the results using Stata and SPSS, we 
found that although the ordinal logit models are based on latent continuous response 
variables for both packages, SPSS PLUM estimated the logit coefficient for the category 
with smaller value when the predictor variable was categorical, and thus the estimated 
thresholds were different from those estimated by Stata. Researchers should understand 
the differences of parameterization of ordinal logistic models using Stata and other 
statistical packages. Researchers should pay attention to the sign before the estimated 
logit coefficients and the cut points in the model, and be cautious in interpreting the 
results. 

When analyzing ordinal logistic regression models, the role of any statistical 
software package is a tool for researchers. The choice of any one is the preference of 
researchers, and it is not the purpose of the study to suggest which one is the best for 
ordinal logistic regression analysis. This demonstration clarifies some of the issues that 
researchers must consider in using different statistical packages for analysis of ordinal 



data. 
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